Goto

Collaborating Authors

 dual adversarial semantic-consistent network


Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning

Neural Information Processing Systems

Generalized zero-shot learning (GZSL) is a challenging class of vision and knowledge transfer problems in which both seen and unseen classes appear during testing. Existing GZSL approaches either suffer from semantic loss and discard discriminative information at the embedding stage, or cannot guarantee the visual-semantic interactions. To address these limitations, we propose a Dual Adversarial Semantics-Consistent Network (referred to as DASCN), which learns both primal and dual Generative Adversarial Networks (GANs) in a unified framework for GZSL. In DASCN, the primal GAN learns to synthesize inter-class discriminative and semantics-preserving visual features from both the semantic representations of seen/unseen classes and the ones reconstructed by the dual GAN. The dual GAN enforces the synthetic visual features to represent prior semantic knowledge well via semantics-consistent adversarial learning. To the best of our knowledge, this is the first work that employs a novel dual-GAN mechanism for GZSL. Extensive experiments show that our approach achieves significant improvements over the state-of-the-art approaches.


Reviews: Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning

Neural Information Processing Systems

The primary contribution of the paper is the dual-GAN structure with semantics-consistency and visual-consistency loss. The paper has novel components (although it is close to [7], see below). The paper shows that their model performs better than the existing methods on GZSL benchmark datasets. To better understand the individual contribution of these losses, the paper gives an ablation study in Table 3. However, it should also include results of ablation study on CUB and SUN datasets in the main paper.


Reviews: Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning

Neural Information Processing Systems

The paper received all accept recommendations and the AC agrees with the recommendation. The authors are requested to revise the paper with the additional clarifications wrt to [7], results from the new experiments on FLO, and discussion points.


Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning

Neural Information Processing Systems

Generalized zero-shot learning (GZSL) is a challenging class of vision and knowledge transfer problems in which both seen and unseen classes appear during testing. Existing GZSL approaches either suffer from semantic loss and discard discriminative information at the embedding stage, or cannot guarantee the visual-semantic interactions. To address these limitations, we propose a Dual Adversarial Semantics-Consistent Network (referred to as DASCN), which learns both primal and dual Generative Adversarial Networks (GANs) in a unified framework for GZSL. In DASCN, the primal GAN learns to synthesize inter-class discriminative and semantics-preserving visual features from both the semantic representations of seen/unseen classes and the ones reconstructed by the dual GAN. The dual GAN enforces the synthetic visual features to represent prior semantic knowledge well via semantics-consistent adversarial learning.


Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning

Ni, Jian, Zhang, Shanghang, Xie, Haiyong

Neural Information Processing Systems

Generalized zero-shot learning (GZSL) is a challenging class of vision and knowledge transfer problems in which both seen and unseen classes appear during testing. Existing GZSL approaches either suffer from semantic loss and discard discriminative information at the embedding stage, or cannot guarantee the visual-semantic interactions. To address these limitations, we propose a Dual Adversarial Semantics-Consistent Network (referred to as DASCN), which learns both primal and dual Generative Adversarial Networks (GANs) in a unified framework for GZSL. In DASCN, the primal GAN learns to synthesize inter-class discriminative and semantics-preserving visual features from both the semantic representations of seen/unseen classes and the ones reconstructed by the dual GAN. The dual GAN enforces the synthetic visual features to represent prior semantic knowledge well via semantics-consistent adversarial learning.